Read Me
Replication Information for 
How Migrating Overseas Shapes Political Preferences: Evidence from a Field Experiment

This Replication file contains the following items:

Quant_Rep.R is the main replication code for all results related to the project / RCT itself. The replicator can set the working directory and add a folder labeled �Figures� to the directory folder, and all the figures in the text and appendix will appear in the figures folder.

The main replication script uses four data sources:
1) BaselineMidline_Cleaned.csv contains the data from the baseline (pre-treatment) survey of the program registrants in September 2018 and the midline (post-selection, pre-migration) survey in January-March 2019. It is a csv dataset with text responses. If you have any questions about the ordering of these responses, please let us know.
2) Endline_Cleaned.sav contains the data from the endline (post-treatment) survey of the program registrants in January-March 2021. It is an SPSS dataset with ordered factor responses, which allows you to see both the text of the responses and the order (if you use as.numeric() in R).
3) Household_Cleaned.sav contains the data from the endline (post-treatment) survey of one family member per program registrant in March-May 2021. It is an SPSS dataset similar to the endline dataset. 
4) wvs_2.dta contains a few variables from the World Values Survey (Wave 7,2017-2022) that are used in this analysis comparing to the main data.

In addition, we have four other replication scripts that create Figures/Tables using data from secondary datasets and a qualitative dataset. These are: 

1) KMS_rep.R is the replication script for all results related to the Kerala Migration Survey of 2013.

kms_emigrant_rep.csv and kms_individual_non_migrant_rep.csv are the datasets from the KMS. The datasets contain information on the sex, age, education, and religion of migrants and non-migrants from Kerala. In addition, the kms_emigrant_rep.csv file contains information on the destination countries of migrants from Kerala.

2) ihds_rep.R is the replication script for all results related to data from the Indian Human Development survey.

ihds_replication.csv is a dataset from the Indian Human Development Survey II conducted in  2011-2012. The dataset provides information on the number of years migrants have spent overseas, individuals' sex, age, education, and religion. 

3) wvs_replication.R is the replication script for most results related to the World Values Survey and Varieties of Democracy index.

wvs.rds which is a combination of the World Value Survey (Wave 7, 2017-2022) and Varieties of Democracy (V-dem, v13). This .rds file contains information about respondents' country of birth, country when was surveyed, immigration status, intergroup tolerance, intergroup trust, and the regime type of the host country. 

Codebook.pdf explains all variables in all datasets.


Data Sources:

Zachariah, Kunniparambil Curien, and S. Irudaya Rajan. "Kerala migration study 2014." Economic and Political Weekly (2016): 66-71.

Desai, Sonalde, Reeve Vanneman and National Council of Applied Economic Research. India Human Development Survey-II (IHDS-II), 2011-12. Inter-university Consortium for Political and Social Research [distributor], 2018-08-08. https://doi.org/10.3886/ICPSR36151.v6

Haerpfer, C., Inglehart, R., Moreno, A., Welzel, C., Kizilova, K., Diez-Medrano J., M. Lagos, P. Norris, E. Ponarin & B. Puranen (eds.). 2022. World Values Survey: Round Seven - Country-Pooled Datafile Version 5.0. Madrid, Spain & Vienna, Austria: JD Systems Institute & WVSA Secretariat. doi:10.14281/18241.20

Coppedge, Michael, John Gerring, Carl Henrik Knutsen, StaffanI Lindberg, Jan Teorell, David Altman, Michael Bernhard, Agnes Cornell, M. Steven Fish, Lisa Gastaldi, Haakon Gjerl�w, Adam Glynn, Sandra Grahn, Allen Hicken, Katrin Kinzelbach, Kyle L. Marquardt, Kelly McMann, Valeriya Mechkova, Anja Neundorf, Pamela Paxton, Daniel Pemstein, Oskar Ryd�n, Johannesvon R�mer, Brigitte Seim, Rachel Sigman,Svend-Erik Skaaning, Jeffrey Staton, Aksel Sundstr�m, Eitan Tzelgov, Luca Uberti, Yi-ting Wang, Tore Wig, and Daniel Ziblatt. 2023. "V-DemCodebookv13" Varieties of Democracy (V-Dem) Project.


The main analysis was conducted using:
R version 4.3.1
RStudio version 2024.04.1
32-GB RAM computer

Package versions:
   packages            
 [1,] "ggplot2"   "3.4.3" 
 [2,] "ggstance"  "0.3.6" 
 [3,] "gdata"     "2.19.0"
 [4,] "gridExtra" "2.3" 
 [5,] "stargazer" "5.2.3" 
 [6,] "haven"     "2.5.3" 
 [7,] "foreign"   "0.8.84"
 [8,] "ivreg"     "0.6.2" 
 [9,] "lmtest"    "0.9.40"
[10,] "sandwich"  "3.0.2" 
[11,] "exr"       "0.1.0" 
[12,] "devtools"  "2.4.5" 
[13,] "xtable"    "1.8.4" 
[14,] "egg"       "0.4.5" 

      packages            
[1,] "DeclareDesign"   "1.0.4" 
 [2,] "tidyverse"       "2.0.0" 
 [3,] "kableExtra"      "1.3.4" 
 [4,] "sf"              "1.0.14"
 [5,] "magrittr"        "2.0.3" 
 [6,] "rio"             "1.0.1" 
 [7,] "stargazer"       "5.2.3" 
 [8,] "lfe"             "2.9.0" 
 [9,] "car"             "3.1.2" 
[10,] "scales"          "1.2.1" 
[11,] "ggthemes"        "4.2.4" 
[12,] "lubridate"       "1.9.3" 
[13,] "survminer"       "0.4.9" 
[14,] "survival"        "3.5.5" 
[15,] "splitstackshape" "1.4.8" 
[16,] "gridExtra"       "2.3" 
[17,] "knitr"           "1.44"
[18,] "modelsummary"    "1.4.2" 
[19,] "wesanderson"     "0.3.6" 
[20,] "janitor"         "2.2.0" 
[21,] "ggprism"         "1.0.4" 
[22,] "sandwich"        "3.0.2" 
[23,] "lmtest"          "0.9.40"
[24,] "ri2"             "0.4.0" 
[25,] "ggplot2"         "3.4.3" 
[26,] "RColorBrewer"    "1.1.3" 
[27,] "xtable"          "1.8.4" 
[28,] "texreg"          "1.38.6"
[29,] "dplyr"           "1.1.3" 
[30,] "tidyr"           "1.3.0" 
[31,] "reshape2"        "1.4.4" 
[32,] "Hmisc"           "5.1.1" 
[33,] "estimatr"        "1.0.0" 
[34,] "patchwork"       "1.1.3" 
[35,] "haven"           "2.5.3" 
[36,] "scales"          "1.2.1" 
[37,] "countrycode"     "1.5.0" 
[38,] "schoolmath"      "0.4.2" 
[39,] "Rmisc"           "1.5.1"

"ggpubr" "0.6.0"
